Authorship attribution of source code by using back propagation neural network based on particle swarm optimization

نویسندگان

  • Xinyu Yang
  • Guoai Xu
  • Qi Li
  • Yanhui Guo
  • Miao Zhang
چکیده

Authorship attribution is to identify the most likely author of a given sample among a set of candidate known authors. It can be not only applied to discover the original author of plain text, such as novels, blogs, emails, posts etc., but also used to identify source code programmers. Authorship attribution of source code is required in diverse applications, ranging from malicious code tracking to solving authorship dispute or software plagiarism detection. This paper aims to propose a new method to identify the programmer of Java source code samples with a higher accuracy. To this end, it first introduces back propagation (BP) neural network based on particle swarm optimization (PSO) into authorship attribution of source code. It begins by computing a set of defined feature metrics, including lexical and layout metrics, structure and syntax metrics, totally 19 dimensions. Then these metrics are input to neural network for supervised learning, the weights of which are output by PSO and BP hybrid algorithm. The effectiveness of the proposed method is evaluated on a collected dataset with 3,022 Java files belong to 40 authors. Experiment results show that the proposed method achieves 91.060% accuracy. And a comparison with previous work on authorship attribution of source code for Java language illustrates that this proposed method outperforms others overall, also with an acceptable overhead.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Experimental and finite-element free vibration analysis and artificial neural network based on multi-crack diagnosis of non-uniform cross-section beam

Crack identification is a very important issue in mechanical systems, because it is a damage that if develops may cause catastrophic failure. In the first part of this research, modal analysis of a multi-cracked variable cross-section beam is done using finite element method. Then, the obtained results are validated usingthe results of experimental modal analysis tests. In the next part, a nove...

متن کامل

Traffic Signal Prediction Using Elman Neural Network and Particle Swarm Optimization

Prediction of traffic is very crucial for its management. Because of human involvement in the generation of this phenomenon, traffic signal is normally accompanied by noise and high levels of non-stationarity. Therefore, traffic signal prediction as one of the important subjects of study has attracted researchers’ interests. In this study, a combinatorial approach is proposed for traffic signal...

متن کامل

Optimization of ICDs' Port Sizes in Smart Wells Using Particle Swarm Optimization (PSO) Algorithm through Neural Network Modeling

Oil production optimization is one of the main targets of reservoir management. Smart well technology gives the ability of real time oil production optimization. Although this technology has many advantages; optimum adjustment or sizing of corresponding valves is still an issue to be solved. In this research, optimum port sizing of inflow control devices (ICDs) which are passive control valves ...

متن کامل

Optimizing the Prediction Model of Stock Price in Pharmaceutical Companies Using Multiple Objective Particle Swarm Optimization Algorithm (MOPSO)

The purpose of this study is to optimize the stock price forecasting model with meta-innovation method in pharmaceutical companies.In this research, stock portfolio optimization has been done in two separate phases.The first phase is related to forecasting stock futures based on past stock information, which is forecasting the stock price using artificial neural network.The neural network used ...

متن کامل

Comparative Analysis of Neural Network Training Methods in Real-time Radiotherapy

Background: The motions of body and tumor in some regions such as chest during radiotherapy treatments are one of the major concerns protecting normal tissues against high doses. By using real-time radiotherapy technique, it is possible to increase the accuracy of delivered dose to the tumor region by means of tracing markers on the body of patients.Objective: This study evaluates the accuracy ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 12  شماره 

صفحات  -

تاریخ انتشار 2017